Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2

نویسندگان

  • Giorgio Corani
  • Marco Zaffalon
چکیده

In this paper, the naive credal classifier, which is a set-valued counterpart of naive Bayes, is extended to a general and flexible treatment of incomplete data, yielding a new classifier called naive credal classifier 2 (NCC2). The new classifier delivers classifications that are reliable even in the presence of small sample sizes and missing values. Extensive empirical evaluations show that, by issuing set-valued classifications, NCC2 is able to isolate and properly deal with instances that are hard to classify (on which naive Bayes accuracy drops considerably), and to perform as well as naive Bayes on the other instances. The experiments point to a general problem: they show that with missing values, empirical evaluations may not reliably estimate the accuracy of a traditional classifier, such as naive Bayes. This phenomenon adds even more value to the robust approach to classification implemented by NCC2.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliable diagnoses of dementia by the naive credal classifier inferred from incomplete cognitive data

Dementia is a serious personal, medical and social problem. Recent research indicates early and accurate diagnoses as the key to effectively cope with it. No definitive cure is available but in some cases when the impairment is still mild the disease can be contained. This paper describes a diagnostic tool that jointly uses the naive credal classifier and the most widely used computerized syste...

متن کامل

Tree-Based Credal Networks for Classification

Bayesian networks are models for uncertain reasoning which are achieving a growing importance also for the data mining task of classification. Credal networks extend Bayesian nets to sets of distributions, or credal sets. This paper extends a state-of-the-art Bayesian net for classification, called tree-augmented naive Bayes classifier, to credal sets originated from probability intervals. This...

متن کامل

JNCC2: An extension of naive Bayes classifier suited for small and incomplete data sets

JNCC2 implements the Naive Credal Classifier 2 (NCC2), i.e., an extension of naive Bayes to imprecise probabilities, designed to return robust classification even on small and/or incomplete data sets, which is often the case in environmental case studies.

متن کامل

Naive Credal Classifier 2: a robust approach to classification for small and incomplete data sets

Naive Credal Classifier, which is an imprecise-probability counterpart of Naive Bayes, is rigorously extended to a very general and flexible treatment of incomplete data, yielding a new classifier called Naive Credal Classifier 2 (NCC2). The new classifier delivers classifications that are robust to the presence of small sample sizes and missing values. In particular, some empirical evaluations...

متن کامل

Lazy Credal Classifier and how to compare credal classifiers

This poster carries out two main contributions: (a) a lazy (or local) version of naive credal classifier (NCC) that we call lazy naive credal classifier (LNCC); (b) two metrics to compare credal classifiers. NCC [1] has extended naive Bayes (NB) to imprecise probabilities, by modeling prior ignorance via the Imprecise Dirichlet Model; the classification is eventually issued by returning the non...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Machine Learning Research

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2008